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Abstract 

This work studies the joint design of precoding and backhaul compression strategies for the downlink 
of cloud radio access networks. In these systems, a central encoder is connected to multiple multi- 
c*2 ' antenna base stations (BSs) via finite-capacity backhaul links. At the central encoder, precoding is 

followed by compression in order to produce the rate-limited bit streams delivered to each BS over the 
corresponding backhaul link. In current state-of-the-art approaches, the signals intended for different BSs 
0^ ' are compressed independently. In contrast, this work proposes to leverage joint compression, also referred 



CO 



to as multivariate compression, of the signals of different BSs in order to better control the effect of the 
additive quantization noises at the mobile stations (MSs). The problem of maximizing the weighted sum- 
rate with respect to both the precoding matrix and the joint correlation matrix of the quantization noises 

CO ■ 

is formulated subject to power and backhaul capacity constraints. An iterative algorithm is proposed that 
achieves a stationary point of the problem. Moreover, in order to enable the practical implementation 

of multivariate compression across BSs, a novel architecture is proposed based on successive steps of 

5h ■ 

minimum mean-squared error (MMSE) estimation and per-BS compression. Robust design with respect 

to imperfect channel state information is also discussed. From numerical results, it is confirmed that 

the proposed joint precoding and compression strategy outperforms conventional approaches based on 

the separate design of precoding and compression or independent compression across the BSs. 
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I. Introduction 

Cellular systems are evolving into heterogeneous networks consisting of distributed base 
stations (BSs) covering overlapping areas of different sizes, and thus the problems of interference 
management and cell association are becoming complicated and challenging [1]. One of the 
most promising solutions to these problems is given by so called cloud radio access networks, 
in which the encoding/decoding functionalities of the BSs are migrated to a central unit. This 
is done by operating the BSs as "soft" relays that interface with the central unit via backhaul 
links used to carry only baseband signals (and not "hard" data information) BH-JHl. Cloud radio 
access networks are expected not only to effectively handle the inter-cell interference but also 
to lower system cost related to the deployment and management of the BSs. However, one of 
the main impairments to the implementation of cloud radio access networks is given by the 
capacity limitations of the digital backhaul links connecting the BSs and the central unit. These 
limitations are especially pronounced for pico/femto-BSs, whose connectivity is often afforded 
by last-mile cables Q]||[9]|, an( ^ f° r BSs using wireless backhaul links [TTOl . 

In the uplink of cloud radio access networks, each BS compresses its received signal to the 
central unit via its finite-capacity backhaul link. The central unit thenperforms joint decoding of 
all the mobile stations (MSs) based on all received compressed signals^. Recent theoretical results 
have shown that distributed compression schemes 021 can provide significant advantages over the 
conventional approach based on independent compression at the BSs. This is because the signals 
received by different BSs are statistically correlated [jT3l - [fT71 . and hence distributed source 
coding enables the quality of the compressed signal received from one BS to be improved by 
leveraging the signals received from the other BSs as side information. Note that the correlation 
among the signals received by the BSs is particularly pronounced for systems with many small 
cells concentrated in given areas. While current implementations lf3l lfT8l employ conventional 

'in fact, joint decompression and decoding, an approach that is now often seen as an instance of noisy network coding 11 II , 
is generally advantageous [13]. 
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independent compression across the BSs, the advantages of distributed source coding were first 
demonstrated in |[T3l . and then studied in more general settings in [[T4ll - lfT7l . Related works based 
on the idea of computing a function of the transmitted codewords at the BSs, also known as 
compute-and-forward, can be found in [[T9l 11201 . 

In the downlink of cloud radio access networks, the central encoder performs joint encoding 
of the messages intended for the MSs. Then, it independently compresses the produced baseband 
signals to be transmitted by each BS. These baseband signals are delivered via the backhaul links 
to the corresponding BSs, which simply upconvert and transmit them through their antennas. This 
system was studied in [|2T | [[22 l . In particular, in (l2T|. the central encoder performs dirty -paper 
coding (DPC) E3l of all MSs' signals before compression. A similar approach was studied 
in j[22| by accounting for the effect of imperfect channel state information (CSI). Reference 
|[24| instead proposes strategies based on compute-and-forward, showing advantages in the low- 
backhaul capacity regime and high sensitivity of the performance to the channel parameters. For 
a review of more conventional strategies in which the backhaul links are used to convey message 
information, rather than the compressed baseband signals, we refer to [|25l - ll28l . 

A. Contributions 

In this work, we propose a novel approach for the compression on the backhaul links of cloud 
radio access networks in the downlink that can be seen as the counterpart of the distributed 
source coding strategy studied in |fl4l- lfi7l for the uplink. Moreover, we propose the joint design 
of precoding and compression. A key idea is that of allowing the quantization noise signals 
corresponding to different BSs to be correlated with each other. The motivation behind this 
choice is the fact that a proper design of the correlation of the quantization noises across the 
BSs can be beneficial in limiting the effect of the resulting quantization noise seen at the MSs. 
In order to create such correlation, we propose to jointly compress the baseband signals to be 
delivered over the backhaul links using so called multivariate compression |fl2l Ch. 9]. We also 
show that, in practice, multivariate compression can be implemented without resorting to joint 
compression across all BSs, but using instead a successive compression strategy based on a 
sequence of Minimum Mean Squared Error (MMSE) estimation and per-BS compression steps. 

After reviewing some preliminaries on multivariate compression in Sec. [Till we formulate the 
problem of jointly optimizing the precoding matrix and the correlation matrix of the quantization 
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noises with the aim of maximizing the weighted sum-rate subject to power and the backhaul 
constraints resulting from multivariate compression in Sec. |IVj There, we also introduce the 
proposed architecture based on successive per-BS steps. We then provide an iterative algorithm 
that achieves a stationary point of the problem in Sec. |V] Moreover, we compare the proposed 
joint design with the more conventional approaches based on independent backhaul compression 
[|2T | -[|24 | or on the separate design of precoding and (multivariate) quantization in Sec. |VH The 
robust design with respect to imperfect CSI is also discussed in detail. In Sec. IVIIl extensive 
numerical results are provided to illustrate the advantages offered by the proposed approach. 
The paper is terminated with the conclusion in Sec. I VIII I 

Notation: We adopt standard information-theoretic definitions for the mutual information 
I(X;Y) between the random variables X and Y, conditional mutual information I(X;Y\Z) 
between X and Y conditioned on random variable Z, differential entropy h(X) of X and 
conditional differential entropy h(X\Y) of X conditioned on Y |[12]|. The distribution of a 
random variable X is denoted by p(x) and the conditional distribution of X conditioned on 
Y is represented by p(x\y). All logarithms are in base two unless specified. The circularly 
symmetric complex Gaussian distribution with mean /i, and covariance matrix R is denoted by 
£A/"(/a, R). The set of all M x N complex matrices is denoted by C MxN , and E(-) represents 
the expectation operator. We use the notations X y and X >~ to indicate that the matrix X is 
positive semidefinite and positive definite, respectively. Given a sequence X\, . . . , X m , we define 
a set X s = {Xj\j E S} for a subset S C {1, . . . , to}. The operation (-)t denotes Hermitian 
transpose of a matrix or vector, and notation S x is used for the correlation matrix of random 
vector x, i.e., E x = E[xxt]; S Xjy represents the cross-correlation matrix £ Xjy = E[xyt]; S x | y 
is used for the conditional correlation matrix, i.e., £ x | y = E[xx^|y]. 

II. System Model 

We consider the downlink of a cloud radio access network as illustrated in Fig. [Q In the 
system, a central encoder communicates to N M MSs through N B distributed BSs. The message 
M k for each fcth MS is uniformly distributed in the set {1, . . . , 2 nRk }, where n is blocklength 
and Rk is the information rate of message M& in bits per channel use (c.u.). Each MS k has 
n Mk receive antennas for k = 1, . . . ,N M , and each BS i is equipped with n Bi antennas for 
i — 1, . . . , N B . Note that the BSs can be either macro-BSs or pico/femto-BSs and that the MSs 
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Figure 1. Downlink communication in a cloud radio access network in which there are Nb multi-antenna BSs and Nm 
multi-antenna MSs. The Nb BSs include both macro-BSs and pico/femto-BSs. The Nm MSs are distributed across all the cells. 



are arbitrarily distributed across the cells. Each zth BS is connected to the central encoder via 
digital backhaul link with finite-capacity Ci bits per c.u. For notational convenience, we define 
n B = J2i=i n B,i as the total number of transmitting antennas, n M = J2k=i n M,k as the total 
number of receive antennas, and the sets Mb = {lj • • • , Nb} and Mm — {!)•••> Nm}- 

As shown in Fig. [31 each message M k is first encoded by a separate channel encoder, which 
produces a coded signal s fc . The signal s k E C r ' feXl corresponds to the r k x 1 vector of encoded 
symbols intended for the A;th MS for a given c.u., and we have r k < riM,k- We assume that each 
coded symbol s fc is taken from a conventional Gaussian codebook so that we have s k ~ CM(0, 1). 
The signals s l5 ...,Sjv M are further processed by the central encoder in two stages, namely 
precoding and compression. As is standard practice, precoding is used in order to control the 
interference between the data streams intended for the same MS and for different MSs. Instead, 
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compression is needed in order to produce the N B rate-limited bit streams delivered to each 
BS over the corresponding backhaul link. Specifically, recall that each BS i receives up to C, 
bits per c.u. on the backhaul link from the central encoder. Further discussion on precoding and 
compression can be found in Sec. [TV] 

On the basis of the bits received on the backhaul links, each BS % produces a vector Xj E 
C nsiXl for each c.u., which is the baseband signal to be transmitted from its ns,i antennas. We 
have the per-BS power constraintsE 

E[|| Xi || 2 ] <P { , foneAf B . (1) 

Assuming flat-fading channels, the signal y k E C lM > h received by MS k is written as 

y k = H fc x + z fc , (2) 

where we have defined the aggregate transmit signal vector x = [x{, . . . , xjy ]\ the additive 
noise z k ~ £/V(0, ijl and the channel matrix H k E C" MfcXns toward MS k as 

Hfc = [H fc)1 H fc2 • • • Hk,N B ] , (3) 

with H fe .j G £, n M,kxn B ,i denoting the channel matrix from BS i to MS k. The channel matrices 
remain constant for the entire coding block duration. We assume that the central encoder has 
information about the global channel matrices H fc for all k E Mm and that each MS k is 
only aware of the channel matrix H fc . The BSs must also be informed about the compression 
codebooks used by the central encoder, as further detailed later. The case of imperfect CSI at 
the central encoder will be discussed in Sec. IV-DI 

Based on the definition given above and assuming single-user detection at each MS, the rates 

R k = I (s fc ; y fc ) (4) 

can be achieved for each MS k E Mm- 

2 The results in this paper can be immediately extended to the case with more general power constraints of the form E[x^0jx] < 
Si for I £ {1, . . . , L}, where the matrix 0; is a non-negative definite matrix (see, e.g., 1291 Sec. II-C]). 
Correlated noise can be easily accommodated by performing whitening at each MS k to obtain ©. 
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Figure 2. Illustration of (a) conventional compression; (b) multivariate compression. 

III. Preliminaries 

This section reviews some basic information-theoretical results concerning multivariate com- 
pression, which will be leveraged in the analysis of the proposed backhaul compression strategy 
in Sec. ITV-Bl 

A. Conventional Compression Problem 

To fix the ideas and the notation, we first consider the conventional compression problem illus- 
trated in Fig. |2r(a). The compression unit compresses a random sequence X n of n independent 
and identically distributed (i.i.d.) samples with distribution p(x) at a rate of R bits per symbol. 
Specifically, the compressor selects a codeword X n within a codebook C of size 2 nR and sends 
the corresponding index, of nR bits, to the decompression unit. At the decompression unit, the 
sequence X n E C indicated by the received index is recovered. Using the standard information- 
theoretic formulation, the compression strategy is specified by a conditional distribution p(x\x), 
which is referred to as the test channel (see, e.g., lfT2l Ch. 2]). For a given test channel, 
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compression consists in selecting a sequence X n G C that is jointly typical] with the sequence 
X n with respect to the given joint distribution p(x,x) = p(x)p(x\x). Compression is hence 
successful if the encoder is able to find a jointly typical sequence X n in the codebook C. A 
classical result in information theory is that this happens with arbitrarily large probability as the 
block length n grows large if the inequality 



I[X;X)<R (5) 

is satisfied E2 Ch. 311551. 



B. Multivariate Compression Problem 

We now review the more general multivariate compression illustrated in Fig. |2]-(b). Here, the 
sequence X n is compressed into M indices with the goal of producing correlated compressed 
versions X", . . . , X M . Each ith index indicates a codeword within a codebook Q of size 2 nRi , 
and is sent to the zth decompression unit for i G {1,...,M}. Each ith decompression unit 
then recovers a sequence X™ G C{ corresponding to the received index. We emphasize that the 
choice of the codewords X™, . . . , X M is done jointly at the compression unit. In particular, the 
specification of the compression strategy is given by a test channel p(xi, . . . , xm\x). This implies 
that the compression unit wishes to find codewords X", . . . , X M that are jointly typical with the 
sequence X n with respect to the given joint distribution p(x, x±, . . . , Xm) = p{x)p{xi, . . . , Xm\x). 
The following lemma provides a sufficient condition for multivariate compression to be successful 
(we refer to [12., Lemma 8.2] for a more precise statement). 

Lemma 1. [12, Ch. 9] Consider an i.i.d. sequence X n and n large enough. Then, there ex- 
ist codebooks C\ , . . . , Cm with rates Ri , . . . , Rm, that have at least one tuple of codewords 
(X™, . . . , X M ) G C\ x . . . x Cm jointly typical with X n with respect to the given joint distri- 
bution p(x, xi, . . . , xm) = p(x)p(xi, . . . , xm\x) with probability arbitrarily close to one, if the 
inequalities 

J2 h (x^ - h (x s \x) <^Ri, for all S C {1, . . . , M} (6) 

i£S j£5 

4 Two sequences X n and Y n are called jointly typical with respect to a distribution p(x,y) if their joint empirical distribution 
(i.e., normalized histogram with step size A — > 0) does not deviate much from p(x,y) (see, e.g., 1121 Ch. 2] for a formal 
definition). 
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are satisfied. 

Proof: See QU Ch. 9] for a proof. ■ 

We observe that, for a given test channel p(xi, . . . , Xm\x), the inequalities © impose joint 
conditions on the rate of all codebooks. This is due to the requirement of finding codewords 
Xi, . . . ,X M that are jointly correlated according to the given test channel p(x 1 , . . . ,xm\%)- 
Also, note that the vector X n may be such that each Xi is itself a vector and that the distortion 
requirements at each decompression unit prescribe that the decompression unit be interested in 
only a subset of entries in this vector. The connection between the multivariate set-up in Fig. 
[2Kb) and the system model under study in Fig. \T\ will be detailed in the next section. 

IV. Proposed Approach and Problem Definition 

In this section, we first propose a novel precoding-compression strategy based on multivariate 
compression for the downlink of a cloud radio access network. We then establish the problem 
definition. Finally, a novel architecture that implements multivariate compression via a sequence 
of MMSE estimation and per-BS compression steps is proposed. 

A. Encoding Operation at the Central Encoder 

As mentioned in the previous section, the operation at the central encoder can be represented 
by the block diagram in Fig. |3] Specifically, after channel encoding, the encoded signals s = 
[s\, . . . , s N ]t undergo precoding and compression, as detailed next. 

1. Precoding: In order to allow for interference management both across the MSs and among 
the data streams for the same MS, the signals in vector s are linearly precoded via multiplication 
of a complex matrix A G C" sX " M . The precoded data can be written as 

x = As, (7) 

where the matrix A can be factorized as 

A = [Ax ■ ■ • A Nm ] , (8) 

with Afc G C nsXnMfc denoting the precoding matrix corresponding to MS k. The precoded data 
x can be written as x = [x{, . . . , xjy ]\ where the signal Sq is the ub,% x 1 precoded vector 
corresponding to the ith BS and given as 

x, = EjAs, (9) 
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Figure 3. Illustration of the operation at the central encoder. 



with the matrix Ej E C riB xns >» having all zero elements except for the rows from (X^-=i n B,j + 1) 
to (5^!=i n B.j) which contain an riB,i x n Bi identity matrix. Note that non-linear precoding using 
DPC techniques can also be considered, as discussed in Remark [3] below. 

2. Compression: Each precoded data stream 5q for i e Mb must be compressed in order to 
allow the central encoder to deliver it to the 2th BS through the backhaul link of capacity C{ bits 
per c.u. Each ith BS then simply forwards the compressed signal Xj obtained from the central 
encoder. Note that this implies that the BSs need not be aware of the channel codebooks and of 
the precoding matrix A used by the central encoder. Instead, they must be informed about the 
quantization codebooks selected by the central encoder. 

Using standard rate-distortion considerations, we adopt a Gaussian test channel to model the 
effect of compression on the backhaul link. In particular, we write the compressed signals x, to 
be transmitted by BS i aso 



Xj — Xj -p Qj 



(10) 



where the compression noise q» is modeled as a complex Gaussian vector distributed as CJ\f(0, Slj,i 

5 The test channel x^ = B^Xi + q^ is seemingly more general than l llOt . but this can be captured by adjusting the matrix A 
in CD- 
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Overall, the vector x = [x{, . . . , xjy ]* of compressed signals for all the BSs is given by 

x = As + q, (11) 

where the compression noise q = [qi,---,q]v ]* is modeled as a complex Gaussian vector 
distributed as q~ CM(0, Q). The compression covariance Ct is given as 



n 



^1,1 ^1,2 

O2 l "2,2 



o L 



N B 



^2,Nk 



^N B ,N E 



(12) 



^N B ,l Qn b ,2 ■ 

where the matrix Ojj is defined as fijj = Efq^q^] and defines the correlation between the quan- 
tization noises of BS i and BS j. Rate-distortion theory guarantees that compression codebooks 
can be found for any given covariance matrix Ci, y under appropriate constraints imposed on 
the backhaul links' capacities. This aspect will be further discussed in Sec. IIH-Bl 

With the described precoding and compression operations, the achievable rate © for MS k 
is computed as 



J(s*;yk) = / fc (A,n) 



(13) 



^logdet(l + H fc (AAt + fi)Hl)-logdet(l + H fc ( J] A,Aj + nJHjJ. 

Remark 1. In the conventional approach studied in [|2Ti - [|24l . the signals Xj corresponding to 
each BS i are compressed independently. This corresponds to setting Q it j = for all i 7^ j in 
(fT2l . A key contribution of this work is the proposal to leverage correlated compression for the 
signals of different BSs in order to better control the effect of the additive quantization noises 
at the MSs. 

Remark 2. The design of the precoding matrix A and of the quantization covariance O can be 
either performed separately, e.g., by using a conventional precoder A such as zero-forcing (ZF) 
or MMSE precoding (see, e.g., [|3T| - [|34l ), or jointly. Both approaches will be investigated in the 
following. 

Remark 3. If non-linear precoding via DPC [23] is deployed at the central encoder with a specific 
encoding permutation 7r DPC : Mm ~^ Nm °f tne MS indices Mm> tne achievable rate i? 7rDPC (A : ) 
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for MS ttdpcO) is given as Rn OFC ( k ) = I (s 7rDPC ( fc ); y7r DP c(fc)l s 7r DP c(i), ■ ■ ■ , s 7r DPC (fc-i)) in lie u of 
© and can be calculated as -R^dpc^) = U D pc(k) (A, ft) with the function f WDPC (k) (A, fl) given 
as 

A DPc(fc) (A, fl) 4 logdet f I + H. DPc(fc) f £ A^pjA^pj + O J H+ Dpc(fe) J (14) 
- logdet I + H^ DPc(fe) J2 A -dpc(0 A I dpc (0 + ° H l PC (fc) 

V \i=k+i J 

Note that the DPC is designed based on the knowledge of the noise levels (including the 
quantization noise) in order to properly select the MMSE scaling factor ll35l . 

B. Multivariate Backhaul Compression 

As explained above, due to the fact that the BSs are connected to the central encoder via 
finite-capacity backhaul links, the precoded signals Xj in © for i E Mb are compressed before 
they are communicated to the BSs using the Gaussian test channels (flOl) . In the conventional 
case in which the compression noise signals related to the different BSs are uncorrected, i.e., 
flij = for all i ^ j E Mb as in 112111 - E4l . the signal x, to be emitted from BS i can be reliably 
communicated from the central encoder to BS % if the condition 



I (XfJ Xj) 



log det (Ej AME t + n M ) - log det (fi M ) < Q (15) 



is satisfied for i E Mb- This follows from standard rate-distortion theoretic arguments (see, 
e.g., [30J and Sec. IIII-AI) . We emphasize that (ITBI) is valid under the assumption that each BS 
i is informed about the quantization codebook used by the central encoder, as defined by the 
co variance matrix Ct i}i . 

In this paper, we instead propose to introduce correlation among the compression noise signals, 
i.e., to set f2y ^ for i ^ j, in order to control the effect of the quantization noise at the 
MSs. As discussed in Sec. Hill introducing correlated quantization noises calls for joint, and 
not independent, compression of the precoded signals of different BSs. As seen, the family 
of compression strategies that produce descriptions with correlated compression noises is often 
referred to as multivariate compression. By choosing the test channel according to (fTTI) (see Sec. 
IIII-BI) . we can leverage Lemma [Q to obtain sufficient conditions for the signal Xj to be reliably 
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delivered to BS i for all i E Mb- In Lemma [21 we use E 5 to denote the matrix obtained by 
stacking the matrices Ej for i E S horizontally. 

Lemma 2. The signals x x , . . . , xjy s obtained via the test channel ([77]) can be reliably transferred 
to the BSs on the backhaul links if the condition 

g s (A,ft) ^J>te) ~ h M*) (16) 

= ^logdet (eJAA+E, + n M ) - logdet (e^E 5 ) < J^Q 

is satisfied for all subsets S C Ab- 

Proof: The proof follows by applying Lemma \T\ by substituting x = As for the signal X 
to be compressed, and x 1; . . . , x Nb for the compressed versions Xi, . . . , X M . ■ 

Comparing (fT3T > with (fT6l ) shows that the introduction of correlation among the quantization 
noises for different BSs leads to additional constraints on the backhaul link capacities. 

C. Weighted Sum-Rate Maximization 

Assuming the operation at the central encoder, BSs and MSs detailed above, we are interested 
in maximizing the weighted sum-rate R sura = Ylk=i w kRk subject to the backhaul constraints 
(fT6l) over the precoding matrix A and the compression noise covariance ft for given weights 
Wk > 0, k E Nm- This problem is formulated as 

N M 

maximize / j Wkfk (A, ft) (17a) 

' ~ fe=i 



s.t. g s (A, ft) < Y^ C h for all S C J\f B , (17b) 

■ies 

tr (EJAAtE, + n M ) < P h for all i E Mb- (17c) 

The condition (|17bl) corresponds to the backhaul constraints due to multivariate compression as 
introduced in Lemma |2] and the condition (117ct imposes the transmit power constraints ©■ It 
is noted that the problem (fTTT) is not easy to solve due to the non-convexity of the objective 
function Ylk=i w kfk (A, ft) in (I17al) and the functions ^5 (A, fi) in (I17bl) with respect to (A, f2). 
In Sec. |Vl we will propose an algorithm to tackle the solution of problem (fT7l . 
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Figure 4. Proposed architecture for multivariate compression based on successive steps of MMSE estimation and per-BS 
compression. 



D. Successive Estimation-Compression Architecture 

In order to obtain correlated quantization noises across BSs using multivariate compression, it 
is in principle necessary to perform joint compression of all the precoded signals Xj corresponding 
to all BSs i for i E Mb ( see Sec. IIII-B|) . If the number of BSs is large, this may easily prove to 
be impractical. Here, we argue that, in practice, joint compression is not necessary and that the 
successive strategy based on MMSE estimation and per-BS compression illustrated in Fig. [4] is 
sufficient. The proposed approach works with a fixed permutation ix : Mb —^ Mb of the BSs' 
indices Mb- 

The central encoder first compresses the signal x^m using the test channel (TTOl) . namely 
x 7r ( 1 ) = x,r(i) + q,r(i), with q 7r ( 1 ) ~ CM(0, ^(1)^(1) )> an d sends the bit stream describing the 
compressed signal x^-m over the backhaul link to BS 7r(l). Then, for any other i e Mb with 
i > 1, the central encoder obtains the compressed signal x 7r ( i ) for BS n(i) in a successive manner 
in the given order n by performing the following steps: 

(a) Estimation: The central encoder obtains the MMSE estimate x^) of x 7r ( i ) given the signal 
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x 7r ( i ) and the previously obtained compressed signals x 7r ( 1 ), . . . ,x 7r ( i _ 1 ). This estimate is given 
by 



X*-(i) = E [x^lu^)] 



(18) 



^^(^u^.jS^^Ujj,), 



where we defined the vector u^) = [xL^, . . . , xL 4 _ 1) ,x[ (i Jt, and the correlation matrices 
S x ,-, u f-, and S u ... are given as 



Sr(i) 5 U 7r(i) 



El« AAtE ^-x + %V:) E L AAt *Mi 



r(iy 



(19) 



and 



S u 



ir(i) 



E f s AAtE,s . l +Os.,s. 1 El AAtE^ 

<~>7T i — l ^TT.Z — 1 ' ^TT,! — 1 l^TT.i — 1 O^i — l /l 



(') 



E T r ,AAtE 5 . , 



El(0 AAtE -W 



(20) 



with fl^x = E^J7Et- for subsets 5,TC J\f B and the set S^ defined as 5^ — {vr(l), • • • , 7r(i)}. 
(Z>) Compression: The central encoder compresses the MMSE estimate x^ to obtain x^ 
using the test channel 

Xff(i) = X vr(j) + qvr(j), (21) 

where the quantization noise q^) is independent of the estimate x^) and distributed as q^) ~ 
CAf(0, £ 



X 7r(i) l X 7r(i) - 



with 



X 7r(i) l X 7r(i) X 7r(i) l U 7r(i) 



^ir(i),*r(*) ^W^.i-l^S^^-i^i-i^fi),^-!- 



(22) 



Note that the first equality in (|22l) follows from the fact that the MMSE estimate x^j) is a suffi- 
cient statistic for the estimation of x 7r ( i ) from u^ (see, e.g., Il3~6l ). Moreover, the compression 
rate /(jcw^x,^)) required by the test channel (12TT ) is given by 

H^(i)',^-K(i)) =h (x 7r (j)) — h (x„-(j) |x w (j)) (23) 

= logdet (E^AA+E^) + n ff(i))ffW J 

- logdet (n ff(0 , ff(i) - ^«,5 7r , i _ 1 05i_iA, i -i n l« J 5 7r , i _ 1 ) • 

To see why the structure in Fig. @]described above realizes multivariate compression, we need 
the following lemma. 
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Lemma 3. The region of the backhaul capacity tuples (Ci, . . . , Cn b ) satisfying the constraints 
di7oD is a contrapolymatroiaj K37\ Def 3.1]. Therefore, it has a corner point for each permutation 
n of the BS indices Mb> and each such corner point is given by the tuple (C^m, • • • , CV(at s )) 
with 

C n (i) =1 (x,r(i);x, x 7r(1 ), . . . ,x ff (j_i)) (24) 

=^(x 7r (j);x 7r (.j)) 

/or i = 1,...,Nb. Moreover, the corner point (C„-(i), . . . , C w (jv s )) i« d24l) w smc/? that the 
constraints A17b\) are satisfied with equality for the subsets S = {%(1)}, {7r(l),7r(2)}, . . . , 
{7r(l),...,7r(JV fl )}. 

Proof: This lemma follows immediately by the definition and properties of contrapolyma- 
troids as summarized in [1371 Def. 3.1]. Moreover, the second equality of (124)) holds due to the 



fact that the MMSE estimate xwj) is a sufficient statistic for the estimation of x^ from u^j) 



(see, e.g., 11361 ). or equivalently from the Markov chain x 7r ( i ) — x^) — u n ^. ■ 

Lemma [3] shows that the region of backhaul capacities that guarantees correct delivery of the 
compressed signals ([TO] ) to the BSs, as identified in Lemma [2] is a type of polyhedron known 
as contrapolymatroid, as exemplified in Fig. |5] A specific feature of contrapolymatroid is that 
the corner points can be easily characterized as in d24l . From standard rate-distortion theory 
arguments, the equality between (l23l and (l24l) implies that the corner point (C^m, • • • , C-k{n b )) 
can be obtained for any permutation it by the successive estimation-compression strategy outlined 
above and illustrated in Fig. |4] 

Overall, the discussion above shows that, for any correlation matrix $7 in (fT2l) . multivariate 
compression is feasible by using the successive estimation-compression architecture in Fig. @]if 
the backhaul capacities satisfy the corner point condition ([24]) for the given BS order n. Note 
that, in general, conditions (1241) are more restrictive than (I17bl) . which allows for any backhaul 
capacities in the contrapolymatroid. This is because the solution to the optimization problem 
(flTT) can be seen by contradiction to lie necessarily on the boundary of the region (I17bl ) but 

6 Let us define M — {1, ...,M} and / : M — > R + with M. being the set of all subsets of M. Then, the polyhedron 
{(xi, . . . ,XM)\^2 isS Xi > f(S), for all S C M} is a contrapolymatroid if the function / satisfies the conditions: (a) 
/(0) = 0; (b) f(S) < f(T) if S C T; (c) f(S) + f(T) < f(S U T) + f(S n T) [37. Def. 3.1]. 
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7(x 2 ;Mi) 



7(x 2 ;x) 




/r(l) = 1, tt(2) = 2 



7T(l) = 2, /r(2) = 1 



/(x^x) /( Xl ;x,x 2 ) 



c 



Figure 5. Illustrative example of the contrapolymatroidal region of backhaul capacities (Ci, C2) satisfying the constraint dl7M 
imposed by multivariate compression for iVs = 2. The two corner points are given by l!24t . 



possibly not on the corner points. Further discussion on this point can be found in Sec. IV-B[ 
where we observe that, in practice, this limitation is not critical. 



V. Joint Design of Precoding and Compression 

In this section, we aim at jointly optimizing the precoding matrix A and the compression 
covariance ft by solving problem (fTTT) . In Sec. |VT1 we will then consider the generally suboptimal 
strategy in which the precoding matrix is fixed according to standard techniques, such as ZF [31 J, 
MMSE [32j or weighted sum-rate maximizing precoding [1331 fl34l by neglecting the compression 
noise, and only the compression noise matrix O is optimized. 

A. MM Algorithm 

As mentioned, the optimization (TTTT) is a non-convex problem. To tackle this issue, we will 
now show how to obtain an efficient scheme that is able to achieve a stationary point of problem 
(flTT) . To this end, we first make a change of variable by defining the variables R*,. = A fc A], 
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for k G Mm- Then, we define the functions f k ({Rj}^{, fl) and g s ({R*,}^, ft) with respect 
to the variables {Rk}k=[ which are obtained by substituting H k = A fc AJ, into the functions 
fk (A, O) and g s (A, O) in problem (fTTT ), respectively, and the transmit power constraint as 
tr fe/£i E lRfcE 4 + UiA < P % for i e J\f B - The so-obtained problem over the variables {R*}j£ 
and Q is still non-convex due to the second term in f k ({R,,} J^, O) and the first term in 
gs ({Rfc}^, fi), which are concave in the variables {R k }^[ an d ^- However, we observe that 
it falls into the class of difference-of-convex (DC) problems [38]. Among various algorithms 
having desirable properties for the solution of DC problems lf38l . we adopt the Majorization 
Minimization (MM) scheme ll38ll . which solves a sequence of convex problems obtained by 
linearizing non-convex parts in the objective function ff, ({Rj}^{, fi) and the constraint function 
#5 ({Rfc}/£i>^)- ^ i s shown that the MM algorithm converges to a stationary point of the 
original non-convex problems (see, e.g., [1341 Theorem 1], [[381 Sec. 1.3.3] and |[39l Theorem 
3]). The proposed algorithm is summarized in Table Algorithm 1 where we define the functions 
^({R? +1) }fil, n (t+1) , {Rf }?», nO) and ^({Rf 1} }fi{, O^ 1 ), {B® }£}, n® 



as 



f k ((R-f 1) >iS, ^ (m) , {Rf >Si, " (t) ) (25) 




T,(t+l) _,_o(t+l) ^ W 



(t+1) + ^ i+1 ) ) h! 



E 



^ +1 Mh£, I + H fc £ RW + flW H+ 



and 



<& ({Rr i} }iS, ^ (m) , {^ffi, ^ w ) (26) 



'N M N M 

^ I E E K +1)E * + <■* . E E K )E * + ni 



-logdetfef^Es 
with the function y(X, Y) given as 



<p(X, Y) ± log det (Y) + ^tr (Y" 1 (X - Y)) . (27) 
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Algorithm 1 MM Algorithm for problem (fTTt 



1. Initialize the matrices {R^ sk=i an( ^ ^^ t0 arbitrary feasible positive semidefinite matrices 
for problem <TTvT > and set t — 1. 

2. Update the matrices {R^ }k=[ an d f^ 4+1 ) as a solution of the following (convex) problem. 

N M 

maximize £ «,*# ({R^jft, fi^ 1 ', {R® }ft, «<«>) (28) 

s.t. g' s ({Rj^}^,n^,{R}'>}jg[,nW) < £c«, for all 5 C Afc, 

ieS 

tr f ^EtR^Ei + ng 1 " 15 < Pi, for alii G jV B . 

3. Go to Step 4 if a convergence criterion is satisfied. Otherwise, set t <— t + 1 and go back to 
Step 2. 

4. Calculate the precoding matrices A k «— VfcD fc ' for k G A0w> where D fc is a diagonal matrix 
whose diagonal elements are the nonzero eigenvalues of H k and the columns of V& are the 
corresponding eigenvectors. 



B. Practical Implementation 

As we have discussed in Sec. [TV] given the solution (A, VL) obtained from the proposed 
algorithm, the central processor should generally perform joint compression in order to obtain 
the signals x» to be transmitted by the BSs. However, as seen in Sec. IIV-DI if the solution is such 
that the comer point conditions (|24l are satisfied for a given permutation n of the BSs' indices, 
then the simpler successive estimation-compression structure of Fig. [4] can be leveraged instead. 
We recall, from Lemma |3] that in order to check whether the conditions ([24]) are satisfied for 
some order ir, it is sufficient to observe which inequalities (I17bl) are satisfied with equality: If 
these inequalities correspond to the subsets S = {n(l)}, {n(l), 7r(2)}, . . . , {vr(l), . . . , n(N B )} 
for a given permutation n, then the given solution corresponds to the corner point (|24|) with 
the given n. In our extensive numerical results, we have consistently found this condition to 
be verified. As a result, in practice, one can implement the compression strategy characterized 
by the calculated covariance Vl by employing the implementation of Fig. |4] with the obtained 
ordering n. 
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C. Independent Quantization 

For reference, it is useful to consider the weighted sum-rate maximization problem with 
independent quantization noises as in [|2T T |-[[24 | . This is formulated as (fTTT ) with the additional 
constraints 

n id = 0, for all i^jE Mb- (29) 



Since the constraints (1291 are affine, the MM algorithm in Table Algorithm 1 is still applicable 
by simply setting to zero matrices flij = for i ^ j as per (l29l . 

D. Robust Design with Imperfect CSI 

So far, we have assumed that the central encoder has information about the global channel 
matrices H^ for k E Mm- In this subsection, we discuss the robust design of the precoding 
matrix A and the compression co variance ft in the presence of uncertainty at the central encoder 
regarding the channel matrices H fc for k E Mm- Specifically, we focus on deterministic worst- 
case optimization under two different uncertainty models, namely singular value uncertainty 
ll40l and ellipsoidal uncertainty models (see [|4T|p2l Sec. 4.1] and references therein). While the 
singular value uncertainty model can be related via appropriate bounds to any normed uncertainty 
on the channel matrices, as discussed in ||4"0l Sec. V], the ellipsoidal uncertainty model is more 
accurate when knowledge of the covariance matrix of the CSI error, due, e.g., to estimation, is 
available [42, Sec. 4.1]. In the following, we briefly discuss both models. 

I) Singular Value Uncertainty Model: Considering multiplicative uncertainty model of [1401 
Sec. II- A], the actual channel matrix H^ toward each MS k is modeled as 

H fc = H fc (I + A fc ), (30) 

where the matrix H fc is the CSI known at the central encoder and the matrix A k E C nsXns 
accounts for the multiplicative uncertainty matrix. The latter is bounded as 

a max (A fc ) < e k < 1, (31) 

where <J max (X) is the largest singular value of matrix X. Then, the problem of interest is to 
maximizing the worst-case weighted sum-rate over all possible uncertainty matrices A k for 
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k G Mm subject to the backhaul capacity (|17b| ) and power constraints (|17c|) , namely 

N M 

maximize min > un. fi. (A, fi) (32a) 

*,«*> {Afc ,, 1311}^ tt 



s.t. $ 5 (A, n) < J] C<, for all 5 C JV B , (32b) 

■ies 

tr (eIAA+E; + n M ) < Pi, for all i G W B . (32c) 



The following lemma offers an equivalent formulation for problem (1321) . 

Lemma 4. The problem 021) is equivalent to the problem f fTTI) w?Y/z £/ze channel matrix H k 
replaced with (1 — efc)Hfc /or fe G A/m- 

Proof: We first observe that the uncertainty matrix A& affects only the corresponding rate 
function /k(A, f2) in ([T3l . Therefore, the minimization versus matrices A fc for k G A(m m (I32ab 
can be performed separately for each k by solving the problem min Afc /^(A, f2). It can be now 
easily seen, following [|40l Theorem 1], that the result of this minimization is obtained when 
Afc is such that A& = — e&I. This concludes the proof. ■ 

Based on Lemma HI one can hence solve problem (1321) by using the MM algorithm in Table 
Algorithm 1 with only change of the channel matrices from {H fc } fc= ^ to {(1 — e k )ii k } k ^[. 

2) Ellipsoidal Uncertainty Model: We now consider the ellipsoidal uncertainty model. To 
this end, for simplicity and following related literature (see, e.g., [42, Sec. 4.1]), we focus on 
multiple-input single-output (MISO) case where each MS is equipped with a single antenna, i.e., 
riM,k = 1 for k G Mm- Thus, we denote the channel vector corresponding to each MS k by 
H fc — \r k E C lxns . The actual channel h fe is then modeled as 

h k = h k + e k , (33) 

with hfc and e k being the presumed CSI available at the central encoder and the CSI error, 
respectively. The error vector e k is assumed to be bounded within the ellipsoidal region described 

as 

e[C k e k < 1, (34) 

for k G Mm w i m the matrix C k >- specifying the size and shape of the ellipsoid PHI . 
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Following the standard formulation, we consider here the "dual" problem of power minimiza- 
tion under signal-to-interference-plus-noise ratio (SINR) constraints for all MSs (see [I42I Sec. 
4.1] and references therein). This problem is stated as 



iVr 



'N M 



minimize \_, A** • tr I /_, ^jRfcEj + Qi 



(35a) 



s.t 






hlRthi 



£ ie AkAW h fc R i h * + h l nh * + 1 



> r fe , for all e k with (jMJ) and k e Af, 



M; 



9S 



(A, n) < J2 C i> for a11 S ^ M?, 



(35b) 
(35c) 



ie5 



where the coefficients /^ > are arbitrary weights, T k is the SINR constraint for MS k, and 
we recall that we have R& = A k A\. for k G Mm- The problem (1331 ) is challenging since it 
contains infinite number of constraints in (|35bl ). But, following the conventional S-procedure H3l 
Appendix B.2], we can translate the constraints (|35bl ) into a finite number of linear constraints 
by introducing auxiliary variables (3 k for k G Nm> as discussed in the following lemma. 

Lemma 5. The constraints &35M hold if and only if there exist constants {f3 k > 0}k=[ such that 
the condition 



h fc S fc h fc S fc h fe 



fik 



is satisfied for all k G Hm where we have defined S k 
k eM M - 



C k 
-1 

= Rfc 



r fc z2jeAf M \{k} ±v i 



(36) 
Rj — Tfefi /or 



Proof: It directly follows by applying the S-procedure I1431 Appendix B.2]. ■ 

By transforming the constraint (I35bl ) into (|36T ). we obtain a problem that falls again in the 
class of DC problems [38]. Therefore, one can easily derive the MM algorithm, similar to 
Table Algorithm 1, by linearizing the non-convex terms in the constraint (|35cl) . The algorithm 
is guaranteed to converge to a stationary point of problem (|35l) (see, e.g., [|34l Theorem 1], ll38l 
Sec. 1.3.3] and E3 Theorem 3]). 

VI. Separate Design of Precoding and Compression 

In this section, we discuss a simpler approach in which the precoding matrix A is fixed a 
priori to some standard scheme, such as ZF, MMSE or weighted sum-rate maximizing precoding, 
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by neglecting the compression noise. The compression covariance O is then designed separately 
so as to maximize the weighted sum-rate. 

A. Selection of the Precoding Matrix 

The precoding matrix A is first selected according to some standard criterion [|3TI - [|34l by 
neglecting the compression noise. A subtle issue arises when selecting the precoding matrix 
A that requires some discussion. Specifically, the design of A should be done by assuming a 
reduced power constraint, say 7jPj for some 7, G (0, 1) for i G Mb- The power offset factor 
7i G (0,1) is necessary since the final signal Xj transmitted by each BS i is given by (flOl) and is 
thus the sum of the precoded signal EJAs and the compression noise q^. Therefore, if the power 
of the precoded part EJAs is selected to be equal to the power constraint Pi, the compression 
noise power would be forced to be zero. But this is possible only when the backhaul capacity 
grows to infinity due to (I17bl ). As a result, in order to make the compression feasible, one needs 
to properly select the parameters 71, . . . ,7tv s depending on the backhaul constraints. 

B. Optimization of the Compression Covariance 

Having fixed the precoding matrix A, the problem then reduces to solving problem (fTTb only 
with respect to the compression covariance Q. The obtained problem is thus a DC problem 
which can be tackled via the MM algorithm described in Table Algorithm 1 by limiting the 
optimization at Step 2 only to matrix ft. It is observed that, as discussed above, this problem 
may not be feasible if the parameters 7^ i G Mb, are too large. In practice, one can set these 
parameters using various search strategies such as bisection. 

VII. Numerical Results 

In this section, we present numerical results in order to investigate the advantage of the 
proposed approach based on multivariate compression and on the joint design of precoding and 
compression as compared to the conventional approaches based on independent compression 
across the BSs and separate design. We will focus on the sum-rate performance -R SU m = 
SfceAf R- k ( Le -' we set Wk = 1 m CLTl]))- We also assume that there is one MS active in 
each cell and we consider three cells, so that we have N B = N M = 3. Every BS is subject to 
the same power constraint P and has the same backhaul capacity C, i.e., Pi = P and Ci = C 
for i G Mb- 
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Figure 6. Per-cell sum-rate versus the backhaul capacity C for the circular Wyner model 0441 with P = 20 dB and g = 0.5. 



A. Wyner Model 

We start by considering as a benchmark the performance in a simple circulant Wyner model. 
In this model, all MSs and BSs have a single antenna and the channel matrices H fcj reduce to 
deterministic scalars given as H fc)/fc = 1 for k — 1, 2, 3 and H fc j = g E [0, 1] for j ^ k [|44|. In 
Fig. [61 we compare the proposed scheme with joint design of precoding and compression with 
state-of-the-art techniques, namely the compressed DPC of [12711 . which corresponds to using DPC 
precoding with independent quantization, and reverse Compute-and-Forward (RCoF) [24]. We 
also show the performance with linear precoding for reference. It is observed that multivariate 
compression significantly outperforms the conventional independent compression strategy for 
both linear and DPC precoding. Moreover, RCoF in [|24l remains the most effective approach 
in the regime of moderate backhaul C, although multivariate compression allows to compensate 
for most of the rate loss of standard DPC precoding in the low-backhaul regime^. 



7 The saturation of the rate of RCoF for sufficiently large C is due to the integer constraints imposed on the function of the 
messages to be computed by the MSs |24|. 
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Figure 7. Average sum-rate versus the power offset factor 7 for the separate design of linear precoding and compression in 
Sec. EI with P = 5 dB and a = dB. 



B. General Fading Model 

In this subsection, we evaluate the average sum-rate performance as obtained by averaging 
the sum-rate R sum over the the realization of the fading channel matrices. The elements of the 
channel matrix H fc :i between the MS in the A;th cell and the BS in the ith cell are assumed to be 
i.i.d. complex Gaussian random variables with CA/"(0, a' i_fc ') in which we call a the inter-cell 
channel gain. Moreover, each BS is assumed to use two transmit antennas while each MS is 
equipped with a single receive antenna. In the separate design, we assume that the precoding 
matrix A is obtained via the sum-rate maximization scheme in [33] under the power constraint 
7P for each BS with 7 e (0, 1) selected as discussed in Sec. IVI-Al Note that the algorithm of 
11331 finds a stationary point for the sum-rate maximization problem using the MM approach, 
similar to Table Algorithm 1 without consideration of the quantization noises. 

Fig. U\ demonstrates the impact of the power offset factor 7 on the separate design of linear 
precoding and compression described in Sec. |VI] with P = 5 dB and a = dB. Increasing 7 
means that more power is available at each BS, which generally results in a better sum-rate 
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Figure 8. Average sum-rate versus the transmit power P for linear precoding with C — 2 bit/c.u. and a = dB. 



performance. However, if 7 exceeds some threshold value, the sum-rate is significantly degraded 
since the problem of optimizing the compression covariance O given the precoder A is more 
likely to be infeasible as argued in Sec. I VI- A I This threshold value grows with the backhaul 
capacity, since a larger backhaul capacity allows for a smaller power of the quantization noises. 
Throughout the rest of this section, the power offset factor 7 is optimized via numerical search. 
In Fig. [8] the average sum-rate performance of the linear precoding and compression schemes 
is plotted versus the transmit power P with C = 2 bit/c.u. and a = dB. It is seen that the gain 
of multivariate compression is more pronounced when each BS uses a larger transmit power. 
This implies that, as the received SNR increases, more efficient compression strategies are called 
for. In a similar vein, the importance of the joint design of precoding and compression is more 
significant when the transmit power is larger. Moreover, it is seen that multivariate compression 
is effective in partly compensating for the suboptimality of the separate design. For reference, we 
also plot the cutset bound which is obtained as mm{f? fu n, 3C} where i? full is the sum-capacity 
achievable when the BSs can fully cooperate under per-BS power constraints. We have obtained 
the rate R iu \\ by using the inner-outer iteration algorithm proposed in [45, Sec. II]. It is worth 
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Figure 9. Average sum-rate versus the transmit power P for the joint design in Sec. [V] with C — 2 bit/c.u. and a = dB. 

noting that only the proposed joint design with multivariate compression approaches the cutset 
bound as the transmit power increases. 

In Fig. [9l we compare two precoding methods, DPC and linear precoding, by plotting the 
average sum-rate versus the transmit power P for the joint design in Sec. |V]with C = 2 bit/c.u. 
and a = dB. For DPC, we have applied Algorithm 1 with a proper modification for all 
permutations 7r DPC of MSs' indices Mm an d took the largest sum-rate. Unlike the conventional 
broadcast channels with perfect backhaul links where there exists constant sum-rate gap between 
DPC and linear precoding at high SNR (see, e.g., Il46l0 . Fig. |9] shows that DPC is advantageous 
only in the regime of intermediate P due to the limited-capacity backhaul links. This implies 
that the overall performance is determined by the compression strategy rather than precoding 
method when the backhaul capacity is limited at high SNR. 

Fig. HO] plots the average sum-rate versus the backhaul capacity C for linear precoding with 
P = 5 dB and a = dB. It is observed that when the backhaul links have enough capacity, 
the benefits of multivariate compression or joint design of precoding and compression become 
negligible since the overall performance becomes limited by the sum-capacity achievable when 
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Figure 10. Average sum-rate versus the backhaul capacity C for linear precoding with P = 5 dB and a = dB. 

the BSs are able to fully cooperate with each other. It is also notable that the separate design 
with multivariate compression outperforms the joint design with independent quantization for 
backhaul capacities larger than 5 bit/c.u. 

Finally, we plot the sum-rate versus the inter-cell channel gain a for linear precoding with 
C = 2 bit/c.u. and P = 5 dB in Fig. \TT\ We note that the multi-cell system under consideration 
approaches the system consisting of N B parallel single-cell networks as the inter-cell channel 
gain a decreases. Thus, the advantage of multivariate compression is not significant for small 
values of a, since introducing correlation of the quantization noises across BSs is helpful only 
when each MS suffers from a superposition of quantization noises emitted from multiple BSs. 

VIII. Conclusions 

In this work, we have studied the design of joint precoding and compression strategies 
for the downlink of cloud radio access networks where the BSs are connected to the central 
encoder via finite-capacity backhaul links. Unlike the conventional approaches where the signals 
corresponding to different BSs are compressed independently, we have proposed to exploit 
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Figure 11. Average sum-rate versus the inter-cell channel gain a for linear precoding with C = 2 bit/c.u. and P — 5 dB. 

multivariate compression of the signals of different BSs in order to control the effect of the 
additive quantization noises at the MSs. The problem of maximizing the weighted sum-rate 
subject to power and backhaul constraints was formulated, and an iterative MM algorithm was 
proposed that achieves a stationary point of the problem. Moreover, we have proposed a novel 
way of implementing multivariate compression that does not require joint compression of all 
the BSs' signals but is based on successive per-BS estimation-compression steps. Robust design 
with imperfect CSI was also discussed. Via numerical results, it was confirmed that the proposed 
approach based on multivariate compression and on joint precoding and compression strategy 
outperforms the conventional approaches based on independent compression and separate design 
of precoding and compression strategies. This is especially true when the transmit power or the 
inter-cell channel gain are large, and when the limitation imposed by the finite-capacity backhaul 
links is significant. 
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